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Abstract 

In a recent article (arXiv:1210.3890), we showed that high multiplicity di-hadron proton-proton 
(p+p) data from the CMS experiment are in excellent agreement with computations in the Color 
Glass Condensate (CGC) Effective Field Theory (EFT). This agreement of the theory with several 
hundred data points provides a non-trivial description of both nearside ("ridge") and away-side 
azimuthal collimations of long range rapidity correlations in p+p collisions. Our prediction in 
arXiv:1210.3890 for proton-lead (p+Pb) collisions is consistent with results from the recent CMS 
P+Pb run at \/^nn — 5.02 TeV for the largest track multiplicity Arrack ~ 40 considered. The 
CMS p+Pb data shows the following striking features: i) a strong dependence of the ridge yield 
on Arrack, with a significantly larger signal than in p+p for the same Arrack) h) a stronger px 
dependence than in p+p for large iV trac k, and iii) a nearside collimation for large Arrack compa- 
rable to the awayside for the lower pr = Pj< lg = p^ soc ' di-hadron windows. We show here that 
these systematic features of the CMS p+Pb di-hadron data are all described by the CGC (with 
parameters fixed by the p+p data) when we extend our prediction in arXiv: 1210.3890 to rarer high 
multiplicity events. We also predict the azimuthally collimated yield for yet unpublished windows 
in the p^ lg ' and p|? soc - matrix. 
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I. INTRODUCTION 



Rapidity separated di-hadron correlations in high multiplicity events at the LHC offer 
sub-femtoscopic scale snapshots of rare configurations constituting the structure of matter in 
the colliding hadrons. A largely unexpected discovery at the LHC by the CMS collaboration 
in high multiplicity N tTac k > HO proton-proton (p+p) events [l[ was a collimation in the 
azimuthal "nearside" separation (A0 w 0) between charged hadrons that have rapidity 
separations 2 < \Ar)\ < 4. For recent reviews on this nearside "ridge" effect, see [2|, y|. 
In ref. Q, we showed that this ridge could be explained by "Glasma graphs" that 
arise in the Color Glass Condensate (CGC) Effective Field Theory (EFT) [sj]. When the 
phase space density of gluons in the proton's wavefunction reach maximal occupancy, or 
saturation, these graphs are significantly enhanced in high multiplicity events relative to 
minimum bias by ct^ 8 , a factor of 10 4 -10 5 for typical values of as- This enhancement is a 
remarkable illustration of how the power counting changes in different dynamical regions of 
the EFT. 

Recently, we extended this study significantly jsj, and showed that a combination of 



saturation |10l . and BFKL dynamics [12|, [13| in the CGC EFT provides an excellent 



description of several hundred data points comprising a matrix (in uniformly spaced windows 
in the di-hadron momenta p^ lg and p^ c ) of the associated di-hadron yield per trigger versus 
A0. A novel feature of this study was the demonstration that BFKL dynamics, which 
generates gluon emissions between the gluons that fragment into triggered hadrons, does an 
excellent job describing the awayside spectra. The description is significantly better than 
PYTHIA-8 [1], and 2 4 QCD graphs in the Quasi-Multi-Regge-Kinematics (QMRK), 
both of which overestimate the awayside yield, especially at larger momenta. 

In ref. 0| , we also made a prediction for the ridge and the awayside collimation in proton- 
lead collisions at y/s NN = 5.02 TeV at the LHC. However, as we shall discuss in detail, the 
prediction corresponded to a value of N tTack ~ 40 for p+Pb collisions. The magnitude of 
the signal is comparable to that in p+p collisions at Arrack ~ 100. Di-hadron data from 



the p+Pb run at the LHC at \/s NN = 5.02 TeV are now available 14j and results are 
available for multiplicities much larger than than those considered in jJ"^These data show 
the following remarkable features, i) They exhibit a strong dependence on the number of 
charged hadron tracks |1 51]. labeled N°^ mc by the CMS collaboration. In particular, it is 
observed that the associated di-hadron yield per trigger in p+Pb is significantly larger than 
the same signal at the same value of N°^ me in p+p collisions, ii) Secondly, they observe 
a distinct px {p 1 ^ ~ p^ c ) dependence of the collimated yield which is peaked around the 
same values of px in both p+p and p+Pb collisions, but drops off much faster in p+Pb with 
increasing p?- iii) Finally, the di-hadron yield, as a function of A0, is nearly as high on 
the nearside as on the awayside for low values of pt, indicating that the long range in Ar) 
awayside di-jet signal is suppressed relative to the Glasma graph contribution. 

In this paper, we will show that all these novel features of the p+Pb data can be ex- 
plained systematically in the Color Glass Condensate framework. The parameters in the 
computations are fixed to be identical to those in our study of p+p collisions in ref. jsj, 
with the exception being the values of the scale Qo(yo) (whose meaning we shall discuss 
further) in the proton and lead nuclei. These are varied at an initial rapidity yo to take into 
account the different geometry of lead nuclei relative to that of the projectile proton. If the 
systematics of the signal were not reproduced by varying the Qo's of proton and lead nuclei, 
there is little freedom left in the framework to vary something else to obtain it. 
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FIG. 1. Anatomy of di-hadron correlations. The glasma graph on the left illustrates its its schematic 
contribution to the double inclusive cross-section (dashed orange curve). On the right is the back- 
to-back graph and the shape of its yield (dashed blue curve). The grey blobs denote emissions all 
the way from beam rapidities to those of the triggered gluons. The solid black curve represents 
the sum of contributions from glasma and back-to-back graphs. The shaded region represents the 
Associated Yield (AY) calculated using the zero-yield-at-minimum (ZYAM) procedure. Figure 
from ref. [9(1. 



The paper is organized as follows. In the next section, we will present the formulae used 
in the computation of Glasma and BFKL graphs. Since all details have been discussed pre- 
viously in [9( and references therein, we will reintroduce them briefly only for completeness, 
our focus here being the understanding of the systematics of the new CMS p+Pb data. In 
section 3, we will discuss in detail results in the CGC, compare these to the data, and make 
predictions for as yet unpublished data. In the final section, we will summarize our con- 
clusions, discuss alternative interpretations and further refinements and tests of the CGC 
framework. 



II. GLASMA AND BFKL CONTRIBUTIONS IN THE CGC EFT 

The collimated correlated two-gluon production Glasma and BFKL graphs are illustrated 
in Fig. (|T]). The collimated contributions from all the Glasma graphs can be compactly 
written as 
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where we have defined 



D i = ® Al (y P , k T)®A 2 (y P ,VT -k T ) [$ A2 (y g ,q T + k T ) + $ A2 (?/ g ,q T -k T )] , 

D 2 = § 2 A , 2 (y q , k T )$ Al (y P , Pt - k T ) [$Ai (v q , <It + k T ) + <£% (y,, qr - k T )] . (2) 

These four terms, called the "single diffractive" and "interference" graphs in [5], constitute 
the leading Pt/Qs behavior. Also included is the next order correction in px/Qs where we 
have[16:] A 1 = 5 2 (p T + q T ) [Z 2 + X| + 23%], such that 

ii= [ ®m (v P , ki±)$A 2 (y q , pt - k 1± ) (k 2 lx ' Pr " k ' ±} 2 2 , 
A 1± kj ± (p T - kuj 

|ki_L x p T | 2 



kf ± (pr - ki ± ) 
(ki_L • pr ~ k 2 ± ) |ki_L x prj 
'ki± k f ± (p r - kij.) 



2 2 = / 3% (y P , ki±)$A 2 (y g , Pt - ki 

2a = / $ J 4 1 (y P ,ki_ L )$A 2 (2/g,PT - ki 
Jk 1± 

The other contribution, A 2 , in Eq. ([!]) can be expressed as 

^2= / $A 1 (2/ P ,ki_ L )$A 1 (l/p,k2_L)$A 2 (2/g,Pr - k 1± )^ A2 (y q ,q T + k 1± ) 
•/k ix 

(kix ■ Pr - k 2 ± ) (k 2± ■ p T - k 2 ± ) + (k 1± x Pr ) (k 2± x p T ) 

2 

k? ± (pr - ki±) 

x (ki± ■ qr - k 2 1± ) (k 2 ± ■ qr - k 2 ± ) + (k 1± x q T ) (k 2± x q r ) 

k 2± (qr + k i± ) 2 

where k 2x = pr — qr — ki x . The above expressions are the result of including all combina- 
torial combinations of graphs represented by the Feynman diagram to the left in Fig. [TJ The 
combinatorics is a result of different ways of averaging over strong color sources between the 
amplitude and complex conjugate amplitude in both projectile and target. 

In these expressions {T^, the only function (besides the one loop running coupling constant 
as) is the unintegrated gluon distribution (UGD) per unit transverse area 

$ A {yM) = 7 ^ k rdr ± r ± J (k ± r ± )[l-T A (y,r ± )} 2 (4) 
2®s Jo 

where 7a is the forward scattering amplitude of a quark-antiquark dipole of transverse 
size rj_ on the target A; it, or equivalently, the UGD, is a univers al q uantity that can 



be determined by solving the Balitsky-Kovchegov (BK) equation [18J, [19| as a function of 
the rapidity y = log (x /x). The forward scattering amplitude TA.(y,f±) at the initial scale 
x = Xq is a dimensionless function of t\Qq, where Qo is a non-perturbative scale at the initial 
rapidity. The saturation scale Qs, defined as the transverse momentum defining the peak 
value of $ on the l.h.s of eq. (]1J), is typically a larger scale even at the initial rapidity, and 
grows rapidly via the BK renormalization group equation with rapidity. In the BK equation, 
different impact parameters in the proton/nuclear target are modeled by varying Qq. The 
minimum-bias (median impact parameter) value we choose for the proton Qq = 0.168 GeV 2 
(corresponding to a Q 2 S ~ 0.7 GeV 2 in the adjoint representation at the initial rapidity), 
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is the value that gives a best fit to deeply inelastic electron-proton scattering data from 



HERA 20 



We now turn to the double inclusive distribution from the back-to-back BFKL graphs 



shown in Fig. [TJ The double inclusive multiplicity can be expressed as |2l|, [22 

d 2 N^ L = 32N c a s (p T )a s (q T ) S ± 
d 2 p T d 2 CL T dy p dy q (2tt) 8 C f p|q| bfkl 



x 



/ / ^A{xi,k 01 _)^ B (x2,k 31 _) Q{k 01 _ - pT,k 3 _L + q T ,y P - y q ) 

Jkn I "/lea I 



where Q is the BFKL Green's function 

"+0O 



1 1 — r +ao / \ 

0^, A,) = (^TTI £ ** L dV ■ («) 



is the BFKL eigen- 
le Gamma function. 



Here C F = (N 2 - 1)/2N C , u(u,n) = -2a s Re + ^) - *(1) 

value, where ^l{z) = dlnT(z)/dz is the logarithmic derivative of t 

Further, we have a s = N c as (^/q a ±qb±) /tt and = arccos j'f^j J ■ 

As shown in Fig. (JTJ), Eq. (E]) gives a collimated A$ contribution exclusively on the 
away side, peaked at A$ = ir, while Eq. ([I]) gives a "dipole" cos(2A$)-like contributions 
with maxima at and n. It's the interplay between these contributions with varying Qq in 
projectile and target that describes the systematics of the proton-proton and proton-lead 
data, that we shall now discuss further. 



III. RESULTS 

As noted, all parameters in Eqs. ([1]) and (EJ are identical to those describing the proton- 
proton data. To simulate the p+Pb collision, all we do is vary Qq in the proton and lead 
nuclei. The proton Ql is varied in multiples of the "minimum bias" value of = 0.168 
GeV 2 to simulate events that probe more central impact parameters in the proton, where the 
gluon density is considerably higher than the gluon density for the median impact parameter 
corresponding to minimum-bias events. Likewise, we define the initial saturation scale in 
lead to be Ql = N™ t ■ 0.168 GeV 2 , where N?^. denotes the number of nucleon participants 
on the lead side. 

For our analysis of the CMS data, we define 

J-2A-y shm Jp™ ln JO z a V a PT v Z / 

The single inclusive gluon distribution in the r.h.s is defined as 0] 

where y s wx — 0.465 is the shift in rapidity in the center-of-mass frame in asymmetrical p+Pb 
collisions towards the lead fragmentation region. In Eq. flTT), the fragmentation functions 
are chosen, as in (9|, to be the NLO KPP parametrization [23j for gluon fragmentation to 
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FIG. 2. The nearside yield per trigger as a function of N°^ mc for 1 < p? < 2, for pr = Pj? g = p^ c - 
Each of the p+Pb curves corresponds to a fixed initial saturation scale in the proton. The trajectory 
of a curve shows how the yield increases with a larger number of participants in the nucleus. 
The initial saturation scale in the Pb nucleus is related to the number of participants through 
Q§(lead) = N^ rt ■ 0.168 GeV 2 . The values of Q§ (proton) = 0.168 - 0.672 GeV 2 (corresponding 
to saturation scales in the adjoint representation of Q 2 S ~ 0.7 — 1.6 GeV 2 ) represent estimates 
these quantities from median ("min. bias") impact parameters in the proton to very central impact 
parameters respectively. 



charged hadrons. Finally, in Eq. (|7|), the transverse overlap area S± is fixed (for Qq = 0.168 
GeV 2 in both protons), from minimum bias proton-proton collisions to give N°^ me = 16, 
the value quoted by CMS This value is subsequently held fixed to determine N°^ mc as 
Ql in both the proton and lead nucleus is varied. 
The double inclusive multiplicity is computed as, 



d 2 N f+ 2A -y B hm 

dr] p dr) q A (r) p , rj q ) (9) 

-2.4-y shift 



dA(f) 
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Bounds on the range of the trigger and associated hadron momenta are denoted respectively 
as p™ m(max ) an d g™ in ( max )_ Likewise, A77 min (Ar7 max ) = 2.0(4.0) denote the pseudo-rapidity 
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FIG. 3. Left: The integrated nearside associated yield per trigger as a function of N°^ me for 
1 < PT < 2. The two curves on which data from |14l j are overlaid are the Q§(proton)=0.336 GeV 2 
and QQ(proton)=0.504 GeV 2 results from Fig. (j2J). Right: The pr {p^ = Pt c ) dependence of the 
associated yield for the same (^(proton) values as the previous plot denoted by green (lower) and 
black (upper) dashed lines, for two different N^ rt ranges. The data here are for N°^ me > 110 that 
is approximated (see Fig. [2]) by the ranges considered. 



gap |24| of hadrons within the experimental acceptance 
A rj q ) = 8 (\r] p - rj q \ - Ar] min ) 9 (Ar? max - \rj p -i] q \). 

The associated yield is computed using the Zero-Yield-at-Minimum (ZYAM) procedure, 




d 2 N 



Assoc. Yield — , 

N ti Jo \ rfA <^ dA 



(10) 



A0 n 



where A0 m ; n . is the angle at which the two particle correlation strength is minimal. An 
important point to note is that the transverse overlap area S± cancels out between the 
numerator and denominator in the r.h.s eliminating a source of uncertainty in di-hadron 
spectra. 

After these preliminaries, we are now ready to discuss our results. In Fig. (j2J), we plot 
the integrated associated nearside yield per trigger (obtained from Eqs. (jUJ) and f lTOj) ) versus 
N°® me as determined in Eq. (J7J). The only inputs are (^(proton) and Qo(lead) = N^ n -0.168 
GeV 2 . We first point out that the prediction in our paper [9J for the p+Pb ridge corresponded 
to N^ Tt = 6 (which we called "central") for the left most curve (with Qo (proton) = 0.168 
GeV 2 ). Clearly, this signal is close in magnitude to the high multiplicity p+p ridge signal, 
if one follows the line of sight of the = 6 grey dashed line. This is similar to the 

observation in Fig. 3 of [14J, where the signal at N°^ me = 60 in p+Pb is comparable to that 
in p+p at Ngg ine = 100. 

What is particularly striking about Fig. (J2J) is the large signal one obtains as one cranks up 
both Ql (proton) and iV?^ in the lead nucleus. As one goes to larger (rarer) values of N°® me , 
one observes that each of the curves grows rapidly. The number of participants on the lead 



side is in line with Monte-Carlo Glauber estimates for not especially rare events [25l . |26 
Interestingly, rarer events are achieved more efficiently by having gluon distributions at 
more central impact parameters in the proton (larger values of Qo (proton)) interact than 
by adding a larger and larger number of participants on the lead side. As is well known, 
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the multiplicity in p+A collisions grows linearly with increasing the saturation scale in the 
proton, but only logarithmically with the saturation scale in the nucleus 271 ]. if the latter 
is the larger of the two [28j. There is also a further effect that when one increases the 
saturation scale in the nucleus, some of the excess multiplicity is pushed out of the detector 
acceptance, this acceptance having wider coverage in the lead nucleus fragmentation region. 
In general, however, one will have a multiplicity distribution with N°^ me generated by a 
number of different impact parameters in the overlap of proton and lead nuclei, and one 
needs to average over the signal on the y-axis of Fig. fl2]) with the appropriate weight for a 
more quantitative analysis. Nevertheless, as seen in the figure, the essential point is that 
there is no problem obtaining a large associated yield for p+Pb collisions at the LHC for 
reasonable values of Qq and N^ t . The reasons for this we will discuss at length in the next 
section. 

In Fig. ([3]), we show comparisons of computations of the integrated nearside associated 
yield per trigger with the CMS data from [14] • In the left plot in this figure, we compare to 
the data the centrality dependence of the associated yield computed for two different values 
of Ql (proton) while varying iVp art on the lead side. These curves are the same as the second 
and third curves (from left) in Fig. ([2]), with the same N^ t values labeling the different 

points. The px distributions (for p^ ls ~ pf? c ) as measured by the CMS collaboration are 
shown in the right plot of Fig. (j3j). Also shown is a compilation of four curves from the 
glasma graph computation obtained by varying the initial saturation scale in the proton 
from Qg(proton) = 0.336 to 0.504 GeV 2 for N^ t =U and 16 or ^ a b rt =12 and 14 respectively. 
These configurations were chosen to be representative of the N°^ mc > 110 centrality class. 
Clearly, as noted, a given ine can correspond to different combinations of configurations 
from the proton and lead side. A more realistic computation would include an average over 
all multiplicities weighted by the corresponding multiplicity distribution. This caveat aside, 
we find that the results in Fig. d3J) reproduce the N°^ me and pt dependence of the associated 
yield rather well. 

It is clear at this point that the glasma graphs are able to account for all of the available 
systematics of the near-side associated yield. We now consider a more differential quantity, 
the correlated yield as a function of the relative azimuthal angle Acj) between di-hadrons 
having momentum p^ lg and pf? c . In order to fully understand the A(j) dependence, there are 
three components one must have under control as summarized in Fig. [TJ firstly, the near- 
side glasma diagrams, which have already been discussed at length. Secondly, one needs to 
include the contribution from mini-jet back-to-back graphs in order to have a quantitative 
picture of the awayside. As mentioned here, and as shown clearly in our previous work 
on high multiplicity p+p collisions, di-jet production with BFKL evolution between the 
triggered particles is the right framework for assessing this quantity. Finally, one needs to 
have control of the underlying event. The underlying event is a true correlation with a 
distinct p^ lg and p^ c structure but no angular dependence. However, we know that there 
are other CGC diagrams that may contribute to the underlying event, which do not 

produce a robust collimation, hence the importance of the ZYAM procedure to remove these 
contributions. We will briefly discuss some interesting characteristics of the underlying event 
within our framework. 

Fig. flU) shows the correlated yield d 2 N/ dA<p for various ranges of p^ lg and p^ c and cen- 



trality classes after performing the ZYAM procedure in each bin [33J. The shaded band 
corresponds to one source of uncertainty in our results from the choice of K factors. The 
curve that appears larger on the awayside correponds to the K factors obtained from our 
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FIG. 4. Correlated yield d 2 N/dA4> after ZYAM as a function of A0 integrated over 2 < |A??| < 4 
for several multiplicity bins, each for a particular range in p^ lg = p|? c . The data points are from 



the CMS collaboration 141 ] . The theoretical curves are the result of adding the glasma and BFKL 
contributions with the band representing the variation in results when changing the K-factors from 
-f^giasma = 1,-^bfkl = 1-1 to -fTgiasma = 1-3,-f^bfki = 0.6. The results for the different multiplicity 
windows correspond (from left to right) to: Qo(proton)=0.168 GeV 2 , Np^ It = 3; (3o(proton)=0.336 
GeV 2 , iV p p a b rt = 6; Q 2 (proton)=0.336 ^ = 12; Q 2 (proton)=0 5Q4 GeV2) N P^ = u . 

Q 2 , (proton) =0.504 GeV 2 , N™ t = 22. Predictions are shown for very large multiplicity windows 
and higher values of p^ lg = p^ c - 
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previous analysis of p+p data (-Kgi aS ma = 1 and ifbfki = 1-1) with the other curve correspond- 
ing to a new choice (fit by eye) of -fQiasma = 1.3 and Kbfki = 0.6. The centrality dependence 
of the result is controlled by our selection of representative values of both Qq (proton) and 
iVp a rt that approximately reproduce the mean multiplicity in each centrality class. 



In Fig. ()5]), we show results for the highest multiplicity events. CMS p+Pb data is only 
available at present for three of these windows diagonal in p^ lg ~ p^ c , shown as the red data 
points. The curves are the sum of the glasma contribution and the BFKL contribution. The 
solid black curve is the result for (proton) = 0.504 GeV 2 on A^ t =14 and the dashed 
green is for (proton) = 0.336 GeV 2 on A^ t =16. As before, we have chosen these values 
to be representative of the N°^ mc > 110 centrality class. A more quantitative result could in 
principle be obtained by the appropriate averaging over various events as discussed earlier. 
The K factors were chosen to coincide with those extracted from our previous analysis of 
p+p collisions (-fQiasma = 1 and i^bfki = l-l). There is no reason why these shouldn't be 



adjusted to p+Pb collisions [34 



The differential associated yields show that the combination of Glasma and BFKL dy- 
namics provides quite a good description of the data without too much fine tuning. The K 
factors for glasma and BFKL graphs used previously for the high multiplicity p+p results 
do a good job in many windows of the matrix in Fig. (j3J) but tend to overpredict the BFKL 
contribution in some of the windows. Reasonable changes in K values, as shown by the 
band, are likely to give a finer tuned description of these ~ 200 data points along with the 
additional several hundred data points of p+p data all with K values of order unity. 



As shown in Fig. (JSJ), the systematics of the away side signal becoming dominated by 
the "dipole"-like Glasma graphs in high multiplicity events is reproduced. It is very non- 
trivial that the BFKL awayside dynamics is suppressed in awayside events such that the 
combination of Glasma+BFKL on the awayside does not overestimate the awayside signal. 
This happens because the BFKL di-jet yield per trigger is very weakly dependent on A part 
since the UGDs in the numerator of Eq. (jSJ) are the same as in the expression for N°^ me in 
Eq. ©• 



Finally, it is interesting to examine the N°^ me dependence of the correlated A0- 
independent yield. Fig. (j5J) (left) demonstrates that the yield depends on N°® me alone 
for differering values of Qo (proton) and N pa , rt , while the collimated signal shown in Fig. ([2]) 
clearly has a more complex structure. In particular, unlike the case of the collimated yield, 
the p+p and p+Pb underlying event contributions lie on the same curve. Also shown in 
Fig. flS} are the data points for Czyam from (TiJ divided by a factor of 5. It is very interesting 
that the data follow the same N°^ me scaling as the glasma graphs. The BFKL contribution 
to the associated yield is shown in Fig. (EJ) (right). It clearly does not have the same N°^ mc 
scaling, and is approximately of the same magnitude as the glasma underl ying event at 

that the c 



100. However, as we noted previously |9j, recent computations |2l|, |35| show 
-independent NLLx contributions are a factor of 2-3 below the LLx contribution 
[36|. Hence, there is ample room for other contributions in the CGC that only produce a 
A0-independent contribution 29M32| as long as they give the same scaling as the glasma 
graphs. 
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FIG. 5. Correlated yield d 2 N/dA<p after ZYAM as a function of \A<j> integrated over 2 < \Arj\ < 4 
for the most central multiplicity bin N°^ mc > 110. The data points are from the CMS 
collaboration [14] and have currently only been provided for the diagonal components 

p^ „ p^c 

of the correlation matrix. The curves are obtained by adding the glasma contributions (K = 1) 
and the BFKL contribution (K = 1.1). The solid black curve is the result for Qq (proton) = 0.504 
GeV 2 on iV~P a b rt =14 and the dashed green is for Qg(proton) = 0.336 GeV 2 on N™=16. 
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FIG. 6. Left: The underlying glasma event per trigger (equivalent to C zyam ) in [14|]) as a function 
of iY t ° r f ine for 1 < Pr lg ,pf c < 2. The large grey squares are the experimental values of Czyam 
divided by a factor 5. All other symbols denote theory computations with a fixed initial scale Qq 
in the proton and varying values of N^. t . Right: The underlying BFKL event per trigger for 
Qq = 0.336 GeV 2 and Qq = 0.504 GeV 2 and varying values of iV part . See text for discussion. 



IV. DISCUSSION AND OUTLOOK 



We showed in the previous section that the CGC EFT gives a good description of the 
novel systematics of proton-lead di-hadron correlations that are long range in rapidity and 
collimated in the azimuthal angle. This is important because an identical analysis previously 
gave very good agreement with the CMS data for di-hadron correlations in high multiplicity 
p+p events. We conclude that the origins of the proton-lead effect are the same as the one 
in proton-proton collisions and unlike nucleus-nucleus collisions, where the systematics of 
the associated yield is dominated by flow Q. A simple but apt analogy that exemplifies 
our conclusion is that a bullet shot through a plane of glass has an interaction cross-section 
closer to the size of the bullet and not that of the glass. 

But what is the deeper origin of the effect ? The systematic features of the comparison 
to data are consistent with the following picture of the proton- lead interaction. As shown 
previously 



on the basis of HERA electron-proton diffractive data, the saturation 
scale in the proton has a strong impact parameter dependence which we have modeled here 
with different values of Qq (proton). In proton-proton collisions, Qo(proton) = 0.168 GeV 2 , 
the value at the median impact parameter is more likely; rare events that correspond to the 
higher Qq which produces the high multiplicity collisions (and the ridge) are very unlikely. 
In contrast, in a proton- lead collision, any given N°^ me has a higher probability to be 
generated by a larger Qq (proton) than the median value. This is because the likelihood that 
gluons at small impact parameters in the proton interact is much larger when the proton is 
scattering off many nucleons along its path, as in a lead nucleus. Such events are more likely 
to dominate the probability Pn for a given A^ t ° r ® me . That would explain why the values of 
the associated yield seen in Fig. ([3]) are more compatible with the larger Qq (proton) values. 



The prior discussion addresses why larger Qq (proton) are more relevant for a given N[ 



offline 
trk 



But it does not explain why the associated yield is so large in p+Pb for any Qq (proton) as 
one varies Q^lead) as shown in Fig. ([2]). The underlying reason is a subtle form of quantum 
entanglement. To simplify the discussion, we will consider only one of the Glasma diagrams 
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FIG. 7. The unintegrated gluon distribution (UGD) of a min. bias proton (Qq (proton) = 0.168 
GeV 2 ) and for lead nuclei for two different values of N pa _ Tt . The former is evolved to rapidity Y=2 
and the latter are evolved to Y=4. The red vertical line kx ~ Q s (proton) denotes the peak of the 
integrand in Eq. (jlip . The remaining angular integral in Eq. (jlip is constrained by kinematics to 
lie within the blue (gray) shaded region for | Pt| = 3 (6) GeV. The collimated signal is determined 
by the curvature in the shaded regions-flat UGDs are uncollimated. 



responsible for the near side collimation evaluated [39] at y p = y q = 0. In this case, the two 
particle correlation is proportional to 

d 2 N ocS ± J d 2 k ± $ 2 A (k T )$ B (\p T - k T |)$ B (|q T - k T |) . (11) 

To ascertain why the above expression yields a signal that is collimated, namely, a larger 
signal for = qr as opposed to the signal when p-r ^ qr, let us consider for simplicity 
|Pt| = \Qt\- This condition, and application of the Cauchy-Schwarz inequality |!o[ leads to 
the condition 

d 2 k ± $ 2 A (k T ) $ B (|p T - k T |) $ B (|qT - k T |) < J d 2 k ± $ 2 A (k T ) $|(|p T - k T |) . (12) 

When the equality holds, there is clearly no collimation because the r.h.s does not depend 
on A(p pq . However, the equality holds if and only if $(|pt + kr|) oc $(|qT + ky|), which is 
only satisfied if the un- integrated gluon distribution is flat within the available phase space. 
Fig. ([7]) clearly shows that the unintegrated gluon distributions are not flat. Therefore, 
on very general grounds, we expect a collimation from the structure of the two particle 
correlation in Eq. (TTTj) . 

Now that we have argued on very general grounds that there must be a collimation, we 
would like to understand the scaling of the yield with N°^ me and iVp art as seen in Fig. fl2]). 
As seen in Fig. ((5]), the underlying event (which characterizes the overall normalization of 
the signal) scales linearly with N°^ mc . Any N°^ me dependence in Fig. ([2]) is therefore a 
consequence of the N°^ me scaling of the normalization. In addition to this N°^ me scaling, 
their is ridge collimation that grows rapidly with A^ part . We will now discuss in turn both 
aspects of the systematics of the observed signal. 
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From the previous discussion, the underlying event has the form 



UF f<Pk ± & A (k T )& B (\pT-k T \) 

UJjj0C fd*k ± <s> A (k T )$ B (\ PT -k T \) ' [i6) 

where the term in the denominator is itself proportional to N°^ me . Because the $'s, as 
shown in Fig. ([7]), are bell shaped curves peaked about the saturation scales, one can deduce 

offline"* 2 OT1 J U™,™ fV, n JJJ? ^ Arofflinc 



by inspection [4lj] that the numerator scales as (N°® ine ) , and hence the UE oc iVj 



trk 



We now address the additional N pa _ rt scaling that is observed only in the collimated 
associated yield (CY). This can be characterized by looking at that ratio of the signal from 
Eq. (fTTj) evaluated at (42j A</> P9 = to that at A(p pq = ir for \p?\ = |qr|, 



rYoc J^ ± $i(k r )$|(| Pr -k r |) 

/d 2 ^i(k T )$ B (|p r -k T |)$ B (|p T + k r |) • 1 J 

The iV t °® me scaling cancels in this ratio. To see simply how the additional collimation arises, 
consider the extreme scenario where the $s are peaked strongly enough to be considered 
Dirac delta distributions. Working within this approximation we can easily perform the inte- 
gral over d 2 k±, whereby <E>^(kr) fixes |kj-| = Qa and the angular integral over $b(\pt — kr|) 
fixes 

Qb ~ Qa ~ Pt 



After making these substitutions [43(, we are left with 



CYoc , « fl W fl > (16) 
*s (yZPr + 2Q\ ~ Q'i) 

The collimated signal is always larger than unity [44| since the maximum of is at Qb- 
As Qb is increased (while keeping px and Qa fixed) the wave function in the denominator is 
probed further away from its maximum leading to a larger collimation. If we make a Gaussian 
approximation for the remaining wavefunctions in Eq. ( TT6|) . we find a rapid growth in the 
collimated signal with A^p art . For the region where Qb > Qa, we find 

CY oc 1 + (Q B - Q A f , (17) 



Q 



which grows as ~ iVp art . 

To summarize the discussion, the behavior of the associated yield is a consequence of the 
quantum entanglement of the wavefunctions of correlated gluons in both the projectile and 
the target. Since two gluons from both projectile and target participate, one obtains the 
overlap of four wavefunctions. Besides energy-momentum constraints on the wavefunctions, 
the signal is sensitive to the detailed structure of these wavefunctions. This includes both 
the density of gluons with varying impact parameter, as well as the pt dependence of the 
gluon distributions for a fixed impact parameter. With the stated simple yet fairly general 
assumptions, the scaling of the collimated yield and the underlying event as a function of 
iV t ° r f lme and iVp art is reproduced. The physics of saturation is absolutely crucial: firstly, on a 
"global" level because the glasma graphs are tremendously enhanced due to the large phase 
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space occupancy of gluons, but equally so because the observed signal is sensitive to detailed 
features of the CGC EFT. 

The theoretical framework employed here can be further improved. An important step 
is to self-consistently include multiple scattering effects alongside the rapidity evolution of 
two gluon production. The framework to do this has been developed but not implemented 
numerically yet |45(. Another improvement is to quantify the NLLx contributions to the 



collimated yield and the underlying event for the kinematics of interest [21|, |35J. Not least, 
the contributions of leading N c multi-gluon correlators and possible pomeron loop effects 
need to be quantified jjjj]. 

Finally, while this work was in preparation, a preprint appeared which interprets the 
effect as due to hydrodynamic flow [46(. As noted previously, we believe the ridge in p+Pb 
collisions to be more analogous to high multiplicity p+p collisions than Pb+Pb collisions: 
the systematics of the study here lends weight to this conclusion. For the p+p case, we 
showed that inclusion of flow j3] changes the structure of the associated yield from that 
observed in the data even for modest flow velocities. While some multiple scattering cannot 
be categorically ruled out, a consistent hydrodynamic description is challenging for systems 
with transverse sizes the order of the proton size because of poor convergence of the gradient 
expansion and the short lifetime of the system. It will be interesting to see whether the 



hydrodynamic description of |46| holds for a wider px and centrality range than shown. In 



this regard, it is important that one include the non-flow jet-like BFKL contribution that 
provides a significant contribution to the awayside yield. 
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